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Abstract 

We prove that the number of copies of any given permutation pat- 
tern q has an asymptotically normal distribution in random permuta- 
tions. 



1 Introduction 

The classic definition of pattern avoidance for permutations is as follows. 
Let p = P1P2 ■ ■ - Pn be a permutation, let k < n, and let q = qiq2 - ■ ■ qu be 
another permutation. We say that p contains g as a pattern if there exists 
a subsequence 1 < ii < i2 <■■■< ik n so that for all indices j and r, the 
inequality qj < qr holds if and only if the inequality pi^ < pi^ holds. If p 
does not contain q, then we say that p avoids q. In other words, p contains 
q ii p has a subsequence of entries, not necessarily in consecutive positions, 
which relate to each other the same way as the entries of q do. 

In a recent survey paper [2] on the monotone permutation pattern 12 - ■ ■ k, 
we have shown that if X„ is the random variable counting copies of that 
pattern in a randomly selected permutation of length n, then as n goes to 
infinity, Xn converges (in distribution) to a normal distribution. When we 
say "random permutation" , we mean that each permutation of length n is 
selected with probability 1/n!. 

In this paper, we will generalize that result for any permutation pattern 
q, and the variable -'^n.q counting the copies of q in permutations of length 
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n. The proof is very similar to the monotone case; just some details have 
to be modified. The result is a far-reaching generalization of the classic 
results (see [3]) for more references) that descents and inversions of random 
permutations are asymptotically normal. As a byproduct, we will see how 
close Var(X„ q) and Var(X„ i2...k) are to each other, for any pattern q of 
length k. 

2 The Proof of Our Theorem 
2.1 Background and Definitions 

We need to introduce some notation for transforms of the random variable 
Z. Let Z = Z - E{Z), let Z = Zj^Nwi^Z), and let Z„ iV(0, 1) mean 
that Zn converges in distribution to the standard normal variable. 

Definition 1 Let {Yn.k\k. = 1, 2, • • • , Nn\ be an array of random variables. 
We say that a graph G is a dependency graph for {Yn^k\k = 1, 2 • • • , N^} if 
the following two conditions are satisfied: 

1. There exists a bisection between the random variables Yn^k o-nd the 
vertices of G, and 

2. If Vi and V2 are two disjoint sets of vertices of G so that no edge of G 
has one endpoint in Vi and another one in V2, then the corresponding 
sets of random variables are independent. 

Note that the dependency graph of a family of variables is not unique. 
Indeed if G is a dependency graph for a family and G is not a complete 
graph, then we can get other dependency graphs for the family by simply 
adding new edges to G. 

Now we are in position to state Janson's theorem, the famous Janson 
dependency criterion. 

Theorem 1 /^/ Let Yn,k be an array of random variables such that for all 
n, and for all k = 1,2, - • • ,Nn, the inequality ^ holds for some 

real number An, and that the maximum degree of a dependency graph of 
{Yn,k\k = 1,2,- ■■ ,Nn} is An. 

Set Yn = Ek=iYn,k and al = Var{Yn). If there is a natural number m 
so that 

iv„Ar^(^)"^o, (1) 
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as n goes to infinity, then 

Yn^N (0,1). 

2.2 Verifying the Conditions of Janson's Criterion 

Let g be a fixed pattern of length k. As q is fixed for tlie rest of this paper, 
we will mark our variables X„ instead of X„ q, in order to avoid excessive 
indexing. 

Let us order the (^) subwords of length k of the permutation piP2 ■ ■ - Pn 
linearly in some way. For 1 < i < (^), let Xn^i be the indicator random 
variable of the event that in a randomly selected permutation of length n, 
the ith subword of length k in the permutation p = pip2 ■ • • Pn is a g-pattern. 
We will now verify that the family of the Xn^i satisfies all conditions of the 
Janson Dependency Criterion. 

First, \Xn^i\ < 1 for all i and all n, since the Xn^i are indicator random 
variables. So we can set An = 1. Second, Nn = (^), the total number of 
subwords of length k in p. Third, if a 7^ 6, then Xa and Xj, are independent 
unless the corresponding subwords intersect. For that, the 6th subword 
must intersect the ath subword in j entries, for some i < j < k — 1. For a 
fixed ath subword, the number of ways that can happen is l^j=i (j) (fclj) = 

(fe) ~ {"^~k^) ~ where we used the well-known Vandermonde identity to 
compute the sum. Therefore, 

/ n\ / n — k\ 

In particular, note that ^ provides an upper bound for A„ in terms of a 
polynomial function of n that is of degree k — 1 since terms of degree k will 
cancel. 

There remains the task of finding a lower bound for an that we can 

then use in applying Theorem [H Let Xn = ^i=i^n,i- We will show the 
following. 

Proposition 1 There exists a positive constant c so that for all n, the in- 
equality 

Var{Xn) > cn^^-^ 

holds. 
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Proof: By linearity of expectation, we have 



Var(X„) 



E{Xl) - {E{X^)f 
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Let Ii (resp. I2) denote the /c-element subword of p indexed by ii, (resp. 
«2)- Clearly, it suffices to show that 



E{Xn,iiXn,i2) — E{Xn,ii)E{Xn,i2) > 

\hnh\<i 



cn 



2k~l 



(7) 



since the left-hand side of d?]) is obtained from the ^ by removing the sum of 
some positive terms, that is, the sum of all E(Xn^ij^Xn^i2) where |/i n/2| > 1- 
As E{Xn^i) = l/k\ for each i, the sum with negative sign in ^ is 



E{Xn^i-^)E{Xn,'. 



Il,l2 



n 



1 

kj W 



which is a polynomial function in n, of degree 2k and of leading coefficient 
-j^. As far as the summands in with a positive sign go, most of them 



are also equal to More precisely, E{Xn,i-^Xn,i2) = ^ when Ii and I2 
are disjoint, and that happens for (^) ("^'^) ordered pairs (11,12) of indices. 
The sum of these summands is 



dr. 



n\ / n — k\ 1 



k k9' 



(8) 



which is again a polynomial function in n, of degree 2k and with leading 
coefficient So summands of degree 2k will cancel out in ([6]). (We will 
see in the next paragraph that the summands we have not yet considered 
add up to a polynomial of degree 2k — 1.) 
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In fact, considering the two types of summands we studied in ([6]) and 
([8]), we see that they add up to 



n 

^kj\ k y A;!2 ^k J kP k\^ 

= n''-'^ + 0{n''-'). (10) 

Next we look at ordered pairs of indices (ii, ^2) so that the corresponding 
subwords Ii and I2 intersect in exactly one entry, the entry x. Let us restrict 
our attention to the special case when Ii and I2 both form g-patterns, and 
X is the ath smaUest entry in Ii and the 6th smallest entry in 12- Given 
q, the pair (a, b) describes the location of x in Ii and in I2 as well. Let /( 
(resp. I2) denote the set of a — 1 positions in Ii (resp. 6—1 positions in I2) 
which must contain entries smaller than x given that Ii (resp. I2) forms a 
(/-pattern. Similarly, let /(' (resp. I2) denote the set of A; — a positions in 
Ii (resp. k — b positions I2) which must contain entries larger than x given 
that Ii (resp. I2) forms a (7-pattern. 

Example 1 Let q = 35142, and let us say that Ii and I2 both form q- 
patterns, and they intersect in one entry x that is the third smallest entry 
in Ii and the fourth smallest entry in I2 (so a = 3, and b = A). Then x is 
the leftmost entry of Ii and the next-to-last entry of l2- Furthermore, the 
third and fifth positions of Ii form I[ and the second and fourth positions of 
Ii form I" . Similarly, the first, third, and fifth positions of I2 form I2 and 
the second position of I2 forms I2. 

Let qa (resp. q^) be the pattern obtained from q by removing its ath 
smallest (resp. 6th smallest) entry. 

Note that Xi-^Xi^ = 1 if and only if all of the following independent 
events hold. 

1. In the {2k — l)-element set of entries that belong to I1UI2, the entry x 
is the (a + 6— l)th smallest. This happens with probability l/{2k — 1). 

2. The a + 6 — 2 entries in positions belonging to /{ U I2 must all be 
smaller than the 2k — a — b entries in positions belonging to /(' U I2. 
This happens with probability j^^i:t\- 



V: 



+b- 



3. • the subword /( is a pattern that is isomorphic to the pattern 
formed by the o — 1 smallest entries of q, 



5 



• the subword I2 is a pattern that is isomorphic to the pattern 
formed by the 6—1 smallest entries of q, 

• the subword /(' is a pattern that is isomorphic to the pattern 
formed by the k — a largest entries of q, and 

• the subword I2 is a pattern that is isomorphic to the pattern 
formed by the k — b largest entries of q. This happens with prob- 
ability {a-iy.{b-l)l{k-a)\(k-b)\- 



Therefore, if |/i fl /2I = !> then 

1 

(2^ - l)(a+V-2)(« - m - mk - a)\{k - by. 



P{Xi,X,, = l) = — ^^,2k-2^, — ^(11) 



1 fa + b-2\f2k-a- b\ 

(12) 



{2k -ly V «-i 

How many such ordered pairs (/i, I2) are there? There are {21^-1) choices 
for the underlying set /1U/2. Once that choice is made, the 0+6— 1st smallest 
entry of Ii U I2 will be x. Then the number of choices for the set of entries 
other than x that will be part of Ii is ("^^^^) i^^j^^a^) ■ Therefore, summing 
over all a and b and recalling (jlip . 

Pn = Yl P{XnXi, = l)= E{X,,Xi,) (13) 

|/in/2|=i |/in/2|=i 

1 / n \ /a + 6 - 2\ ^ /2A; - a - 6^ ^ 



(2A;-1)!V2A; 

The expression we just obtained is a polynomial of degree 2k — 1, in the 
variable n. We claim that its leading coefficient is larger than k"^ /k\^. If 
we can show that, the proposition will be proved since (llOp shows that the 
summands not included in (|13|) contribute about ~-fpr^^'^~^ to the left-hand 
side of dZ]). 

Recall that by the Cauchy-Schwarz inequality, \i ti,t2, - ■ ■ ,tm are non- 
negative real numbers, then 

m ^-^ 

i=l 

where equality holds if and only if all the ti are equal. 
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Let us apply this inequality with the numbers ("^^x^) C\-a ^) P^^Y^^S 
the role of the tj, where a and b range from 1 to A;. We get that 



E 

l<a,fe<A: 



> 



r, ') I \^ (a+b-2\ (2k-a-b 

a + b-2\ /2k-a-b\ {2^i<a,b<k\ a-i )\ k^a 



a — 1 J \ k — a J k"^ 



(16) 

We will use Vandermonde's identity to compute the right-hand side. To 
that end, we first compute the sum of summands with a fixed h = a + h. We 
obtain 

l<a,fe<fc ^ ^ ^ ^ h=2a=l ^ ^ ^ ^ 

^ ec::;) 

h=2 ^ ^ 

Substituting the last expression into the right-hand side of (I16p yields 

l<a,6<fc V / V / V / 

Therefore, ^ and ([20]) imply that 

1 / n \{2k-\f (2k-t\^ 
^ (2fc- 1)! V2A; - ly P V A: - 1 y ' 

As we pointed out after (fT3|) . p„ is a polynomial of degree 2/c — 1 in the 
variable n. The last displayed inequality shows that its leading coefficient is 
larger than 

1 1 (2A;-2)!2 _ A;2 

(2k - 1)!2 ■ fc2 ■ (A: - 1)14 - ;fc!4 

as claimed. 

Comparing this with (jlOp completes the proof of our Proposition. O 



We can now return to the application of Theorem [T] to our variables X^^,. 
By Proposition [H there is an absolute constant C so that (7„ > Cn^~^'^ for 
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all n. So ([T]) will be satisfied if we show that there exists a positive integer 
m so that 

Clearly, any positive integer m is a good choice. So we have proved the 
following theorem. 

Theorem 2 Let q be a fixed permutation pattern of length k, and let Xn he 
the random variable counting occurrences of q in permutations of length n. 
Then Xn — > -/V(0, 1). In other words, Xn is asymptotically normal. 

The following Corollary shows how close the variances of the numbers of 
copies of two given patterns are to each other. 

Corollary 1 For any pattern q of length k, we have 

Var{Xn,c,) = Ckn'^-^ + 0{n^^-^), 

where 

1 ^ fa + h-2\^ f2k- a-b\^ k"^ 

~ (2k - 1)!2 ^ ia_iM k-a ) ~W 

l<a,b<k ^ ^ ^ ^ 

We point out that this does not mean that V ar{Xn^c^ does not depend on 
q. It does, and it is easy to verify that Var(X4^i23) 7^ Var(X4^i32). However, 
it is only the terms of degree at most 2/c — 2 of Var(X„^q) that depend on q. 
Proof: Note that in the proof of Theorem [21 we have not used anything 
about the pattern q apart from its length. Our claim then follows from 
comparing ([TO]) and ([HI). ^ 
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